Efficient HPSG Parsing Algorithm with Array Unification

نویسندگان

  • Kenji Nishida
  • Kentaro Torisawa
  • Jun’ichi Tsujii
چکیده

This paper presents a method for improving parsing performance of parsers for HPSG. The method was obtained by extending Torisawa’s parsing method for HPSG. His parsing method utilizes a CFG compiled from a given HPSG-based grammar, and the parser predicts the possible parse trees with the CFG. Since the amount of unification is reduced because of this prediction, parsing performance is improved. However, we observed that there is a limit of speedup by the method. This is due to the fact that a compiled CFG cannot capture all the constraints in an original HPSG. We add an extra mechanism called array unification to a parser for Torisawa’s parsing method and overcome this limitation. The mechanism performs part of type unification and improves precision of parse tree prediction with a small overhead. We observed improvement of parsing speed by a factor of 2.8 with an existing Japanese grammar and 1.7 with an existing English grammar.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Parsing with Large-Scale Unification Grammars

The efficiency problem in parsing with large-scale unification grammars, including implementations in the Head-driven Phrase Structure grammar (HPSG) framework, used to be a serious obstacle to their application in research and commercial settings. Over the past few years, however, significant progress in efficient processing has been achieved. Still, many of the proposed techniques were develo...

متن کامل

CuteForce - Deep Deterministic HPSG Parsing

We present a deterministic HPSG parser capable of processing text incrementally with very fast parsing times. Our system demonstrates an efficient data-driven approach that achieves a high level of precision. Through a series of experiments in different configurations, we evaluate our system and compare it to current state-of-the-art within the field, and show that high quality deterministic pa...

متن کامل

Lenient Default Unification for Robust Processing within Unification Based Grammar Formalisms

This paper describes new default unification, lenient default unification. It works efficiently, and gives more informative results because it maximizes the amount of information in the result, while other default unification maximizes it in the default. We also describe robust processing within the framework of HPSG. We extract grammar rules from the results of robust parsing using lenient def...

متن کامل

Towards efficient probabilistic HPSG parsing: integrating semantic and syntactic preference to guide the parsing

We present a framework for efficient parsing with probabilistic Head-driven Phrase Structure Grammars (HPSG). The parser can integrate semantic and syntactic preference into figures-of-merit (FOMs) with the equivalence class function during parsing, and reduce the search space by using the integrated FOMs. This paper presents a CKY algorithm with this function and experimental results of beam t...

متن کامل

Flexible Structural Analysis of Near-Meet-Semilattices for Typed Unification-Based Grammar Design

We present a new method for directly working with typed unification grammars in which type unification is not well-defined. This is often the case, as large-scale HPSG grammars now usually have type systems for which many pairs do not have least upper bounds. Our method yields a unification algorithm that compiles quickly and yet is nearly as fast during parsing as one that requires least upper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999